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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Nelson, Edward L. 

Nelson, Peter J. 

(ii) TITLE OF INVENTION: NOVEL VECTOR FOR 
POLYNUCLEOTIDE VACCINES 

(iii) NUMBER OF SEQUENCES: 2 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: MORGAN & FINNEGAN, L.L.P. 

(B) STREET: 345 PARK AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10154 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 MB 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WORDPERFECT 5.1 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/143 06 

(B) FILING DATE: 14 -AUG- 1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US60/023931 

(B) FILING DATE: 14-AUG-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: KATHRYN M. BROWN 

(B) REGISTRATION NUMBER: 34556 

(C) REFERENCE/DOCKET NUMBER: 2026-4236US1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 758-4800 

(B) TELEFAX: (212) 751-6849 

(C) TELEX: 421792 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: CDNA 
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(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 

GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT 4 0 

GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 80 

GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 120 

TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 16 0 

CTTACCGGAT ACCTCTCCGC CTTTCTCCCT TCGGGAAGCG 200 

TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTCAGTTC 240 

GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 280 

CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT 320 

ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 360 

ACTGGCAGCA GCCACTGGTA ACAGGATTAG C AG AG CG AGG 40 0 

TATGTAGGCG GTG CTACAGA GTTCTTGAAG TGGTGGCCTA 44 0 
ACTACGGCTA CAC 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 
CACCGCCTAC ATACCTCGCT CTGCTAATCC 
GGCTGCTGCC AGTGGCGATA AGTCGTGTCT 
GACTCAAGAC GATAGTTACC GGATAAGGCG 
GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
AACGACCTAC ACCGAACTGA GATACCTACA 
TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 
GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 
CCTGTCGGGT TTCGCCACCT CTGACTTGAG 
TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 
CAGCAACGCG GCC 



AACTCTGTAG 40 

TGTTACCAGT 80 

TACCGGGTTG 120 

CAGCGGTCGG I 60 

GCTTGGAGCG 200 

CCGTGAGCAT 240 

AAGGCGGACA 280 

GAGAGCGCAC 320 

TCTTTATAGT 3 60 

CGTCGATTTT 400 

GGAAAAACGC 440 

453 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 



(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 

GAATTCTTTC GGACTTTTGA AAGTGATGGT GGTGGCCGAA 
GGATTCGAAC CTTCGAAGTC GATGACGGCA GATTTAGAGT 
CTGCTCCCTT TGGCCGCTCG GGAACCCCAC CACGGGTAAT 
GCTTTTACTG GCCTGCTCCC TTATCGGGAA GCGGGGCGCA 
T C ATATC AAA TGACGCGCCG CTGTAAAGTG TTACGTTGAG 
AAAGAATTC 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 



(iv) ANT I - SENSE : No 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 

GAATTCTTTC TCAACGTAAC ACTTTACAGC GGCGCGTCAT 40 

TTGATATGAT GCGCCCCGCT TCCCGATAAG GGAGCAGGCC 80 

AGTAAAAGCA TT AC CCGTGG TGGGGTTCCC GAGCGGCCAA 120 

AGGGAGCAGA CTCTAAATCT GCCGTCATCG ACTTCGAAGG 160 

TTCGAATCCT TCCCCCACCA CCATCACTTT CAAAAGTCCG 200 

AAAGAATTC 209 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 

AATAAA 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 



ATTAAA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7 



AGTAAA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 
AAGAAC 6 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 

AATACA 6 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

GCCTTAAGGG CCATATGGTG AGTGGATCCC TTGACCCCAG 40 

GCGGGGATGG GGAGACCTGT AGTCAGAGCC CCCGGGCAGC 80 

ACAGGCCAAT GCCCGTCCTT CCCCTGCAGG ATGAGTAGTG 120 

AGTGCCTCTC CTGGCCCTGG AAGTTGCCAC TCCAGTGCCC 160 

ACCAGCCTTG TCCTAATAAA ATTAAGTTGC ATCATTTTGT 200 

CTGACTAGGT GTCCTCTATA ATATTAT 227 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 7 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

( i i i ) HYPOTHETICAL : No 

( iv) ANTI - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

ATAATATTAT AGAGGAC AC C TAGTCAGAAC AAATGATGCA 40 

ACTTAATTTT ATTAGGACAA GGCTGGTGGG CACTGGAGTG 80 

GCAACTTCCA GGGCCAGGAG AGGCACTCAC TACTCATCCT 120 

GCAGGGGAAG GACGGG C ATT GGCCTGTGCT GCCCGGGGGC 16 0 

TCTGACTACA GGTCTCCCCC ATCCCCGCCT GGGGTCAAGG 200 

CATCCACTCA CCATATGGCC CTTAAGG 227 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

CCTCGGTACC TGCCATGGCG CGGATTCTTT ATCACTGATA 4 0 

AGTTGGTGGA CATATTATGT TTATCAGTGA TAAAGTGTCA 80 

AGCATGACAA AGTTGCAGCC GAATACAGTG ATCCGTGCCG 12 0 

GCCCTGGACT GTTGAACGAG GTCGGCGTAG ACGGTCTGAC 16 0 

GACACGCAAA CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG 200 

GCGCTTTACT GGCACTTCAG GAACAAGCGG GCGCCTTAAG 24 0 

GGCCATATGC CG 252 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:13: 

CCTCGGTACC TGCCACCATG GCGCGGATTC TTTAT 3 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI-SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 
CGGCATATGG CCTTAAGGCG CCCGCTTGTT CCTGAAGT 3 8 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 

GCCTTAAGGG CCATATGGTG AGTGGATGCC TTGACCCCAG 40 
GCGGGGATGG GGGAGACCTG TAGTCAGAGC CCCCGGGCAG 80 

CACAGGCCAA TGCCCGTCCT TCCCCTGCAG GATGAGTAGT 120 

GAGTGCCTCT CCTGGCCCTG GAAGTTGCCA CTCCAGTGCC 160 

CACCAGCCTT GTCCTAATAA AATTAAGTTG CATCATTTTG 200 

TCTGACTAGG TGTCCTCTAT AATATTAT 228 
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(2) INFORMATION FOR SEQ ID NO: 16: 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 



(ii) 



MOLECULE TYPE: 



cDNA 



(iii) 



HYPOTHETICAL: 



No 



(iv) 



ANTI- SENSE: 



NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:16: 



TGCCATGGCG CGGATTCTTT AT CACTGAT A AGTTGGTGGA 
CATATTATGT TTATCAGTGA TAAAGTGTCA AGCATGACAA 
AGTTGCAGCC GAATACAGTG ATCCGTGCCG GCCCTGGACT 
GTTGAACGAG GTCGGCGTAG ACGGTCTGAC GACACGCAAA 
CTGGCGGAAC GGTTGGGGGT GCAGCAGCCG GCGCTTTACT 
GGCACTTCAG GAACAAGCGG GCGCCTTAAG GGCCATATGG 
TGAGTGGATG CCTTGACCCC AGGCGGGGAT GGGGGAGACC 
TGTAGTCAGA GCCCCCGGGC AGCACAGGCC AATGCCCGT C 
CTTCCCCTGC AGTGAGTAGT GACTGCCCGG GTGGGATCCC 
TGTGACCCCT CCCCAGTGCC TCTCCTGGCC CTGGAAGTTG 
CCACTCCAGT GCCCACCAGC CTTGTCCTAA TAAAATTAAG 
TTGCATCATT TTGTCTGACT AGGTGTCCTC TATAATATTA 
TAAGCTTGAT ATCGAATTCT TTCTCAACGT AACACTTTAC 
AGCGGCGCGT CATTTGATAT GATGCGCCCC GCTTCCCGAT 
AAGGGAGCAG GCCAGTAAAA GCATTACCCG TGGTGGGGTT 
CCCGAGCGGC CAAAGGGAGC AGACTCTAAA TCTGCCGTCA 
TCGACTTCGA AGGTT CGAAT CCTTCCCCCA CCACCATCAC 
TTTCAAAAGT CCGAAAGAAT TCCTGCAGCC CGTGTAGCCG 
TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA 
CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC 
CAGTGGCGAT AAGTCGTGTC TTACCGGGTT GGACTCAAGA 
CGATAGTTAC CGGATAAGGC GCAGCGGTCG GGCTGAACGG 
GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA 
CACCGAACTG AGATACCTAC AGCGTGAGCA TTGAGAAAGC 
GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 
TAAGCGGCAG GGTCGGAACA GGAGAGCGCA CGAGGGAGCT 
TCCAGGGGGA AACGCCTGGT ATCTTTATAG TCCTGTCGGG 
TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT 
CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC 
GGCCGGGGGA TCCGGAGAGC TCACTCTAGA TG AG AG AG CA 
GTGAGGGAGA GACAGAGACT CGAATTTCCG GAGCTATTTC 
AGTTTTCTTT TCCGTTTTGT GCAATTTCAC TTATGATACC 

GGCCAATGCT TGGTTGCTAT TTTGGAAACT CCCCTTAGGG 
GATGCCCCTC AACTGGCCCT ATAAAGGGCC AGCCTGAGCT 
GCAGAGGATT CCTGCAGAGG ATCAAGACAG CACGTGGACC 
TCGCACAGCC TCTCCCACAG GTACC 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1280 

1320 
1360 
1400 
1425 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANT I - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 



ATGAGCAAGG GCGAGGAACT GTTCACTGGC 
TTCTCGTGGA ACTGGATGGC GATGTGAATG 
TTCTGTCAGC GGAGAGGGTG AAGGTGATGC 
AAGCTCACCC TGAAATTCAT CTGCACCACT 
CTGTGCCATG GCCAACACTG GTCACTACCT 
CGTGCAGTGC TTTTCCAGAT ACCCAGACCA 
CATGACTTTT TCAAGAGCGC CATGCCCGAG 
AGGAGAGAAC CATCTTTTTC AAAGATGACG 
GACCCGCGCT GAAGTCAAGT TCGAAGGTGA 
AATAGAATCG AGTTGAAGGG CATTGACTTT 
GAAACATTCT CGGCCACAAG CTGGAATACA 
CCACAATGTG TACATCATGG CCGACAAGCA 
AT C AAGGTCA ACTTCAAGAT CAGACACAAC 
GATCCGTGCA GCTGGCCGAC CATTATCAAC 
AATCGGCGAC CGCCCTGTGC TCCTCCCAGA 
CTGTCCACCC AGTCTGCCCT GTCTAAAGAT 
AGAGAGACCA CATGGTCCTG CTGGAGTTTG 
TGGGATCACA CATGG CATGG ACGAGCTGTA 



(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1911 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:18: 



GTGGTCCCAA 4 0 

GGCACAAATT 80 

CACATACGGA 12 0 

GGAAAGCTCC 160 

TCACCTATGG 2 00 

TATGAACGAG 24 0 

GGCTATGTGC 280 

GGAACTACAA 3 2 0 

CACCCTGGTG 3 6 0 

AAGGAAGATG 400 

ACTATAACTC 44 0 

AAAGAATGGC 480 

ATTGAGGATG 520 

AGAACACTCC 56 0 

CAACAATTAC 6 00 

CCCAACGAAA 640 

TGACCGCTGC 6 80 

CAAGTGAGC 719 
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TATGAGCAAG GGCGAGGAAC TG TT CACTGG CGTGGTCCCA 4 0 

ATTCTCGTGG AACTGGATGG CGATGTGAAT GGGCACAAAT 80 

TTTCTGTCAG CGGAGAGGGT GAAGGTGATG CCACATACGG 12 0 

AAAGCTCACC CTGAAATTCA TCTGCACCAC TGGAAAGCTC 16 0 

CCTGTGCCAT GGCCAACACT GGTCACTACC TTCACCTATG 200 

GCGTGCAGTG CTTTTCCAGA TACCCAGACC ATATGAAGCA 24 0 

GCATGACTTT TTCAAGAGCG CCATGCCCGA GGGCTATGTG 2 80 

CAGGAGAGAA CCATCTTTTT CAAAGATGAC GGGAACTACA 320 

AGACCCGCGC TGAAGTCAAG TTCGAAGGTG ACACCCTGGT 3 60 

GAATAGAATC GAGTTGAAGG GCATTGACTT TAAGGAAGAT 4 00 

GGAAACATTC TCGGCCACAA GCTGGAATAC AACTATAACT 44 0 

CCCACAATGT GTACATCATG GCCGACAAGC AAAAGAATGG 4 80 

CAT CAAGGT C AACTTCAAGA TCAGACACAA CATTGAGGAT 520 

GGATCCGTGC AGCTGGCCGA CCATTATCAA CAGAACACTC 56 0 

CAATCGGCGA CGGCCCTGTG CTCCTCCCAG ACAACCATTA 600 

CCTGTCCACC CAGTCTGCCC GTCTAAAGAT CCCAACGAAA 640 

AGAGAGACCA CATGGTCCTG CTGGAGTTTG TGACCGCTGC 6 80 

TGGGATCACA CATGG CATGG ACGAGCTGTA CAAGTGAGCC 720 

ATATGGTGAG TGGATGCCTT GACCCCAGGC GGGGATGGGG 76 0 

GAGACCTGTA GTCAGAGCCC CCGGGCAGCA CAGGCCAATG 800 

CCCGTCCTTC CCCTGCAGTG AGTAGTGACT GCCCGGGTGG 840 

GATCCCTGTG ACCCCTCCCC AGTGCCTCTC CTGGCCCTGG 880 

AAGTTGCCAC TCCAGTGCCC ACCAGCCTTG TCCTAATAAA 920 

ATTAAGTTGC ATCATTTTGT CTGACTAGGT GTCCTCTATA 960 

ATATTATAAG CTTGATATCG AATTCTTTCT CAACGTAACA 1000 

CTTTACAGCG GCGCGTCATT TGATATGATG CGCCCCGCTT 104 0 

CCCGATAAGG GAGCAGGCCA GTAAAAGCAT TACCCGTGGT 1080 

GGGGTTC CCG AGCGGCCAAA GGGAGCAGAC TCTAAATCTG 1120 

CCGTCATCGA CTTCGAAGGT TCGAATCCTT CCCCCACCAC H60 

CATCACTTTC AAAAGTCCGA AAGAATTCCT GCAGCCCGTG 12 00 

TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC 124 0 

CGC CTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 128 0 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC 1320 

TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT 13 60 

GAACGGGGGG TT CGTGC AC A CAGCCCAGCT TGGAGCGAAC 14 00 

GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA 1440 

GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 1480 

ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 1520 

GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT 1560 

GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT 1600 

GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 1640 

CAACGCGGCC GGGGG ATCCG GAGAGCTCAC TCTAGATGAG 1680 

AGAGCAGTGA GGGAGAGACA GAGACTCGAA TTTCCGGAGC 1720 

TATTT CAGTT TTCTTTTCCG TTTTGTGCAA TTTCACTTAT 1760 

GATACCGGCC AATGCTTGGT TGCTATTTTG GAAACTCCCC 1800 

TTAGGGGATG CCCCTCAACT GGCCCTATAA AGGGCCAGCC 184 0 

TGAGCTGCAG AGGATTCCTG CAGAGGATCA AGACAGCACG 1880 

TGGACCTCGC ACAGCCTCTC CCACAGGTAC C 1911 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 



Pro 


Asp 


Leu 


Ser 


Tyr 


Met 


Pro 


He 


Trp 


Lys 


Phe 


Pro 


1 






5 










10 






Asp 


Glu 


Glu 


Gly 


Ala 


Cys 


Gin 


Pro 


Cys 


Pro 


He 


Asn 




15 










20 










Cys 


Thr 


His 


Ser 


Cys 


Val 


Asp 


Leu 


Asp 


Asp 


Lys 


Gly 


25 










30 










35 




Cys 


Pro 


Ala 


Glu 


Gin 


Arg 


Ala 


Ser 


Pro 


Leu 


Thr 


Ser 






40 










45 








He 


He 


Ser 


Ala 


Val 


Val 


Gly 


He 


Leu 


Leu 


Val 


Val 




50 










55 










60 


Val 


Leu 


Gly Val 


Val 


Phe 


Gly 


He 


Leu 









65 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 



Pro 


Ala 


Pro 


Gly Ala 


Gly Gly 


Met 


Val 


His 


His 


Arg 


1 








5 










10 






His 


Arg 


Ser 


Ser 


Ser 


Thr 


Arg 


Ser 


Gly 


Gly 


Gly 


Asp 




15 










20 










Leu 


Thr 


Leu 


Gly 


Leu 


Glu 


Pro 


Ser 


Glu 


Glu 


Glu 


Ala 


25 








30 










35 




Pro Arg 


Ser 


Pro 


Leu 


Ala 


Pro 


Ser 


Glu 


Gly 


Ala 


Gly 








40 










45 








Ser Asp 


Val 


Phe 


Asp 


Gly Asp 


Leu 


Gly 


Met 


Gly 


Ala 




50 










55 










60 


Ala 


Lys 


Gly 


Leu 


Ser 


Leu 


Pro 


Thr 


His 


Asp 


Pro 


Ser 






65 










70 






Pro 


Leu 


Gin 


Arg 


Tyr 


Ser 


Glu 


Asp 


Pro 


Thr 


Val 


Pro 






75 










80 










Leu 


Pro 


Ser 


Glu 


Thr 


Asp 


Gly Tyr 


Val 


Ala 


Pro 


Leu 


85 










90 










95 




Thr 


Cys 


Ser 


Pro 


Gin 


Pro 


Glu 


Tyr 


Val 


Asn 


Gin 


Pro 






100 










105 








Asp 


Val 


Arg 


Pro 


Pro 


Pro 


Ser 


Pro 


Arg 


Glu 


Gly 


Pro 


110 








115 










120 


Leu 


Pro 


Ala 


Ala 


Arg 


Pro 


Ala 


Gly 


Ala 


Thr 


Leu 


Glu 










125 










130 






Arg 


Pro 


Lys 


Thr 


Leu 


Ser 


Pro 


Gly 


Lys 


Asn 


Gly 


Val 




135 










140 
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Val 


j_j_y o 


7\ crj 


Val 


Phe 


Ala 


Phe 


Gly 


Gly 


Ala 


Val 


Glu 


145 






a. -j \j 










155 




Asn 


Pro 


Glu 


Tyr 

i a c\ 
lou 


Leu 


Thr 


Pro 


Gin 


Gly 
165 


Thr 


Cys 


Ser 


Pro 


KjXTi 


Pro 




x yx 


Val 


Asn 


Gin 


Pro 


Asp 


Val 


Arg 


170 








175 










180 


Pro 


Gin 


Pro 


Pro 


ser 






Glu 


Glv 


Pro 


Leu 


Pro 








loo 










190 






Ala. 


Ala 


Arg 

-HOC 

195 


iri t-> 


Ala 

rix c*. 


Glv 


Ala 


Thr 
200 


Leu 


Glu 


Arg 


Pro 


Lys 


Leu 


Ser 


Pro 


nl 

oiy 




Ann 




Val 


Val 


Lys 


ASD 


205 










^ ± u 










215 


Glu 


Val 


Phe 


Ala 


Phe 

U 


Gly 


Gly 


Ala 


Val 


Glu 
225 


Asn 


Pro 


Tyr 


Leu 


Thr 


Pro 


Gin 


Gly 


Gly 


Ala 


Ala 


Pro 


Gin 


Pro 


230 










235 










240 


His 


Pro 


Pro 


Pro 


Ala 


Phe 


Ser 


Pro 


Ala 


Phe 


Asp 


Asn 








245 










250 






Leu 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Pro 


Pro 


Glu 


Arg 


Gly 


Ala 




255 










260 








Glu 


Pro 


Pro 


Ser 


Thr 


Phe 


Lys 


Gly 


Thr 


Pro 


Thr 


Ala 


260 










270 










275 




Asn 


Pro 


Glu 


Tyr 
280 


Leu 


Gly 


Leu 


Asp 


Val 
285 


Pro 


Val 






(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 



He He Ser Ala Val Val Gly lie Leu Leu Val Val 
15 10 

Val Leu Gly Val Val Phe Gly He Leu He 
15 20 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:22: 



GCCACCATGG CCCCTGACCT CTCCTACATG 
AGTTTCCAGA TGAGGAGGGC GCATGCCAGC 
CAACTGCACC CACTCCTGTG TGGACCTGGA 
TGCCCCGCCG AGCAGAGAGC CAGCCCTCTG 
TCTCTGCGGT GGTTGGCATT CTGCTGGTCG 
GGTGGTCTTT GGGATCCTCA TCAAGCGACG 
AT CACATGT C CAGACCCTGC CCCGGGCGCT 
TCCACCACAG GCACCGCAGC TCATCTACCA 
TGGGG AC CTG ACACTAGGGC TGGAGCCCTC 
GCCCCCAGGT CTCCACTGGC ACCCTCCGAA 
CCGATGTATT TGATGGTGAC CTGGGAATGG 
GGGGCTGCAA AGCCTCCCCA CACATGACCC 
C AG CGGTACA GTGAGGACCC CACAGTACCC 
AGACTGATGG CTACGTTGCC CCCCTGACCT 
GCCTGAATAT GTGAACCAGC CAGATGTTCG 
CCTTCGCCCC G AG AGGGCC C TCTGCCTGCT 
CTGGTGCCAC TCTGGAAAGG CCCAAGACTC 
GAAGAATGGG GTCGTCAAAG ACGTTTTTGC 
GCCGTGGAGA ACCCCGAGAC TTGACACCCC 
TGCCCCTCAG CCCCACCCTC CTCCTGCCTT 
TTCGACAACC TCTATTACTG GGAC CAGGAC 
GGGGGGCTCC ACCCAGCACC TTCAAAGGGA 
AGAGAACCCA GAGTACCTGG GTCTGGACGT 
AGCCTTAAGG GCCATATGGT GAGTGGATGC 
GGCGGGGATG GGGGAGACCT GTAGTCAGAG 
GCACAGGCCA ATGCCCGTCC TTCCCCTGCA 
ACTGCCCGGG TGGGATCCCT GTGACCCCTC 
CTCCTGGCCC TGGAAGTTGC CACTCCAGTG 
TTGTCCTAAT AAAATTAAGT TG CAT CATTT 
GGTGTCCTCT ATAATATTAT AAGCTTGATA 
TCTCAACGTA ACACTTTACA GCGGCGCGTC 
ATGCGCCCCG CTTCCCG AT A AGGGAGCAGG 
CATTACCCGT GGTGGGGTTC CCGAGCGGCC 
GACTCTAAAT CTGCCGTCAT CGACTTCGAA 
CTTCCCCCAC CACCATCACT TTCAAAAGTC 
CCTGCAGCCC GTGTAGCCGT AGTTAGGCCA 
AACTCTGTAG CACCGCCTAC ATACCTCGCT 
TGTTACCAGT GGCTGCTGCC AGTGGCGATA 
TACCGGGTTG GACTCAAGAC GATAGTTACC 
CAGCGGTCGG GCTGAACGGG GGGTTCGTGC 
GCTTGGAGCG AACGACCTAC ACCGAACTGA 
GCGTGAGCAT TGAGAAAGCG CCACGCTTCC 
AAGG CGGACA GGTATCCGGT AAGCGGCAGG 
GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA 
TCTTTATAGT CCTGTCGGGT TTCGCCACCT 
CGTCGATTTT TGTGATGCTC GTCAGGGGGG 
GGAAAAACGC CAGCAACGCG GCCGGGGGAT 
CACTCTAGAT GAGAGAGCAG TGAGGGAGAG 
GAATTTCCGG AG CTATTTCA GTTTTCTTTT 



r^rr* a TPTrr a. 


40 


CI IbLLLLAl 


80 


t*r* a a ta c* mrr* 


120 


TV /■■»/" ,r P/■" , f" , AT/* 1 A 


160 


TGGTL. I 1 vjVjvj 


200 


GCAGCAGAAC? 


240 


GGGGGCATGG 


280 


GGAGTGGCGG 


*aon 
j \j 


mo TV TV TV Z" 1 /T TV 

TGAAGAGGAG 


360 


GGGGCTGGCT 


400 

*± \j \j 


GGGCAGCCAA 


4.40 


CAGCCCTCTA 


4R0 

"X O \J 


CTGCCCTCTG 


con 

™) ^ V/ 


GCAGCCCCCA 


\J \J 


GCCCCAGCCC 


D U \J 


GCCCGACCTG 


640 


fr*/'""r'^/***o/^ a t~*r* 
TCTCCCL-AvjCj 


680 


CTTTGGGGG 1 


720 


AGCiG ALjVj A<j L 


760 


C At? L. v., CAVj U U 


800 


L. UftL VJAvj *— 


840 




880 




920 


v^, X J. W^W- V- V— 


960 


l^. V_ v_ v^, v^VjVjvj V^fl 


1000 


HTGAGTAGTG 


1040 


CCCAGTGCCT 


1080 


CCCACCAGCC 


1120 


TGTCTGACTA 


1160 


TCGAATTCTT 


1200 


ATTTGATATG 


1240 


CCAGTAAAAG 


1280 


AAAGGGAGCA 


1320 


GGTTCGAATC 


1360 


CGAAAGAATT 


1400 


CCACTTCAAG 


1440 


CTG CTAATC C 


1480 


AGTCGTGTCT 


1520 


GGATAAGGCG 


1560 


ACACAGCCCA 


1600 


GATACCTACA 


1640 


CGAAGGGAGA 


1680 


GTCGGAACAG 


1720 


ACGCCTGGTA 


1760 


CTGACTTGAG 


1800 


CGGAGCCTAT 


1840 


CCGGAGAGCT 


1880 


ACAGAGACTC 


1920 


CCGTTTTGTG 


1960 
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CAATTTCACT TATGATACCG GCCAATGCTT GGTTGCTATT 2000 

TTGGAAACTC CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA 2 04 0 

TAAAGGGCCA GCCTGAGCTG CAGAGGATTC CTGCAGAGGA 208 0 

TCAAG AC AG C ACGTGGACCT CGCACAGCCT CTCCCACAGG 2120 

TACCT " 



2125 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: oligonucleotide 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 
GTCTGCCACC ATGGCCTACT CCCCTGC 27 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(iii) HYPOTHETICAL: No 

(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:24: 
TTCTTTGGTG ACCTACCTCT TCGGAATTGC CGAGTC 36 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
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(ii) 



MOLECULE TYPE; 



CDNA 



(iii) 



HYPOTHETICAL: 



No 



(iv) 



ANTI -SENSE : 



NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25: 



ATGGAGG AG C CGCAGTCAGA TCCTAGCGTC GAGCCCCCTC 
TGAGTCAGGA AACATTTTCA GACCTATGGA AACTACTTCC 
TGAAAACAAC GTTCTGTCCC CCTTGCCGTC CCAAGCAATG 
GATGATTTGA TGCTGTCCCC GGACGATATT GAACAATGGT 
TCACTGAAGA CCCAGGTCCA GATGAAGCTC CCAGAATGCC 
AGAGGCTGCT CCCCGCGTGG CCCCTGCACC AGCAGCTCCT 
ACACCGGCGG CCCCTGCACC AGCCCCCTCC TGGCCCCTGT 
CATCTT CTGT CCCTTCCCAG AAAACCTACC AGGGCAGCTA 
CGGTTTCCGT CTGGGCTTCT TGCATTCTGG GACAGCCAAG 
TCTGCCACCA TGGCCTACTC CCCTGCGTCT GTGACTTGCA 
CGTACTCCCC TGCCCTCAAC AAGATGTTTT GCCAACTGGC 
CAAGACCTGC CCTGTGCAGC TGTGGGTTGA TTCCACACCC 
CCGCCCGGCA CCCGCGTCCG CGCCATGGCC ATCTACAAGC 
AGTCACAGCA CATGACGGAG GTTGTGAGGC GCTGCCCCCA 
CCATGAGCGC TGCT C AG ATA GCGATGGTCT GGCCCCTCCT 
CAGCGTCTTA T C CG AGTGG A AGGAAATTTG CGTGTGGAGT 
ATTTGGATGA CAGAAACACT TTTCGACATA GTGTGGTGGT 
GCCCTATGAG CCGCCTGAGG TTGGCTCTGA CTGTACCACC 
ATCCACTACA ACTACATGTG TAACAGTTCC TGCATGGGCG 
GCATGAACCG GAGGCCCATC CTCACCATCA TCACACTGGA 
AGACTCCAGT GGTAATCTAC TGGGACGGAA CAGCTTTGAG 
GTGCGTGTTT GTGCCTGTCC TGGGAGAGAC CGGCGCACAG 
AGGAAGAGAA TCTCCGCAAG AAAGGGGAGC CTCACCACGA 
GCTGCCCCCA GGGAGCACTA AGCGAGCACT GCCCAACAAC 
ACCAGCTCCT CTCCCCAGCC AAAGAAGAAA CCACTGGATG 
GAGAATATTT CACCCTTCAG ATCCGTGGGC GTGAGCGCTT 
CGAGATGTTC TTTGGTGACC TACCTCTTCG GAATTGCCGA 
GTCTTCCGAG AG CTG AATGA GGCCTTGGAA CTCAAGGATG 
C C CAGG CTGG GAAGGAGCCA GGGGGGAGCA GGGCTCACTC 
CAGCCACCTG AAGTCCAAAA AGGGTCAGTC TACCTCCCGC 
CATAAAAAAC TCATGTTCAA GACAGAAGGG CCTGACTCAG 
AC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
640 
680 
720 
760 
800 
840 
880 
920 
960 
1000 
1040 
1080 
1120 
1160 
1200 
1240 
1242 



(A) 
(B) 
<C) 
(D) 



LENGTH: 608 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY : unknown 



(ii) 



MOLECULE TYPE: 



cDNA 



(iii) 



HYPOTHETICAL: 



NO 
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(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26 



CTCGGGCCGC 
CCCTGACGAG 
TGGCGAAACC 
CCCCTGGAAG 
GCCGCTTACC 
AGCGTGGCGC 
GTTCGGTGTA 
CGAACCCCCC 
AACTATCGTC 
CGCCACTGGC 
GAGGTATGTA 
CCTAACTACG 
GCG CTCTGCT 
TAGCTCTTGA 
GGTTTTTTTG 
AAGGATCT 



GTTGCTGGCG 
CATCACAAAA 
CGACAGGACT 
CTCCCTCGTG 
GGATACCTGT 
TTTCT CAATG 
GGTCGTTCGC 
GTTCAGCCCG 
TTGAGTCCAA 
AGCAGCCACT 
GGCGGTG CTA 
GCTACACTAG 
GAAGCCAGTT 
TCCGGCAAAC 
TTTGCAAGCA 



TTTTTCCATA 
ATCGACGCTC 
ATAAAGATAC 
CGCTCTCCTG 
CCGCCTTTCT 
CTCACGCTGT 
TCCAAGCTGG 
ACCGCTGCGC 
CCCGGTAAGA 
GGTAACAGGA 
CAGAGTTCTT 
AAGGACAGTA 
ACCTTCGGAA 
AAACCACCGC 
GCAGATTACG 



GGCTCCGCCC 
AAGTCAGAGG 
CAGGCGTTTC 
TTCCGACCCT 
CCCTTCGGGA 
AGGTATCTCA 
GCTGTGTGCA 
CTTATCCGGT 
CACGACTTAT 
TTAGCAGAGC 
GAAGTGGTGG 
TTTGGTATCT 
AAAGAGTTGG 
TGGTAGCGGT 
CGCAGAAAAA 



40 
80 
120 
160 
200 
240 
280 
320 
360 
400 
440 
480 
520 
560 
600 
608 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
( ii i ) HYPOTHETICAL : No 
(iv) ANTI -SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA 
GTTGGTGGAC ATATTATGTT TAT C AGTGAT 
GCATGACAAA GTTGCAGCCG AATACAGTGA 
CCCTGGACTG TTGAACGAGG TCGGCGTAGA 
ACACGCAAAC TGGCGGAACG GTTGGGGGTG 
CGCTTTACTG GCACTTCAGG AACAAG CGGG 
GCCATATGGT GAGTGGATGC CTTGACCCCA 
GGGGAGACCT GTAGTCAGAG CCCCCGGGCA 
ATGCCCGTCC TTCCCCTGCA GGATGAGTAG 
TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC 
TGTCCTAATA AAATTAAGTT GCAT CATTTT 
GTGTCCTCTA TAATATTATA AGCTTGATAT 
CGGACTTTTG AAAGTGATGG TGGTGGGGGA 
CCTTCGAAGT CGATGACGGC AGATTTAGAG 
TTGGCCGCTC GGGAACCCCA CCACGGGTAA 
GGCCTGCTCC CTTATCGGGA AGCGGGGCGC 
ATGACGCGCC GCTGTAAAGT GTTACGTTGA 
CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC 
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TCACTGATAA 40 

AAAGTGTCAA 80 

TCCGTGCCGG 120 

CGGTCTGACG 160 

CAGCAGCCGG 2 00 

CGCCTTAAGG 240 

GGCGGGGATG 280 

GCACAGGCCA 320 

TGAGTGCCTC 3 60 

CCACCAGCCT 4 00 

GTCTGACTAG 440 

CGAATTCTTT 4 80 

AGGATTCGAA 520 

TCTGCTCCCT 560 

TGCTTTTACT 600 

ATCATATCAA 640 

GAAAGAATTC 680 

CATAGGCTCC 720 
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GCCCCCCTGA CG AG CAT CAC AAAAATCGAC 

GAGGTGGCGA AACCCGACAG GACTATAAAG 

TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 

CCCTGCCGCT TACCGGATAC CTGTCCGCCT 

GGGAAGCGTG GCGCTTTCTC AATGCTCACG 

CTCAGTT CGG TGTAGGTCGT TCGCTCCAAG 

TGCACGAACC CCCCGTTCAG CCCGACCGCT 

CGGTAACTAT CGTCTTGAGT CCAACCCGGT 

TTATCGCCAC TGGCAGCAGC CACTGGTAAC 

GAG CGAGGTA TGTAGGCGGT GCTACAGAGT 

GTGGCCTAAC TACGGCTACA CTAGAAGGAC 

ATCTGCGCTC TGCTGAAGCC AGTTACCTTC 

TTGGTAGCTC TTGATCCGGC AAACAAACCA 

CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT 

AAAAAAGGAT CTGGGGG AT C CGGAGAGCTC 

AGAGAGCAGT GAGGGAGAGA CAGAGACTCG 

GCTATTTCAG TTTTCTTTTC CGTTTTGTGC 

ATGATACCGG CCAATGCTTG GTTGCTATTT 

CCTTAGGGGA TGCCCCTCAA CTGGCCCTAT 

CCTGAGCTGC AGAGGATTCC TGCAGAGGAT 

CGTGGACCTC GCACAGCCTC TCCCACA 



GCTCAAGTCA 760 

ATACCAGGCG 800 

CCTGTTCCGA 84 0 

TTCTCCCTTC 880 

CTGTAGGTAT 920 

CTGGGCTGTG 960 

GCGCCTTATC 1000 

AAGACACGAC 104 0 

AGGATTAGCA 1080 

TCTTGAAGTG 1120 

AGTATTTGGT 116 0 

GGAAAAAGAG 1200 

CCGCTGGTAG 124 0 

TACGCGCAGA 1280 

ACTCTAGATG 1320 

AATTTCCGGA 1360 

AATTTCACTT 14 00 

TGGAAACTCC 1440 

AAAGGGCCAG 1480 

CAAGACAGCA 1520 

1547 



(2) I NFORKAT I ON FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 
( i i i ) HYPOTHETICAL : No 
(iv) ANT I - SENSE : No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 28: 



GGTACCTGCC ACCATGGCGC GGATTCTTTA 
GTTGGTGGAC ATATTATGTT TATCAGTGAT 
GCATGACAAA GTTGCAGCCG AATACAGTGA 
CCCTGGACTG TTGAACGAGG TCGGCGTAGA 
ACACGCAAAC TGGCGGAACG GTTGGGGGTG 
CGCTTTACTG GCACTTCAGG AACAAGCGGG 
GCCATATGGT GAGTGGATGC CTTGACCCCA 
GGGGAGACCT GTAGT CAG AG CCCCCGGGCA 
ATGCCCGTCC TTCCCCTGCA GGATGAGTAG 
TCCTGGCCCT GGAAGTTGCC ACTCCAGTGC 
TGTCCTAATA AAATTAAGTT GCATCATTTT 
GTGTCCTCTA TAATATTATA AGCTTGATAT 
CGGACTTTTG AAAGTGATGG TGGTGGGGGA 
CCTTCGAAGT CGATGACGGC AGATTTAGAG 
TTGGCCGCTC GGGAACCCCA CCACGGGTAA 
GGCCTGCTCC CTTATCGGGA AGCGGGGCGC 
ATGACGCGCC GCTGTAAAGT GTTACGTTGA 



TCACTGATAA 40 

AAAGTGTCAA 80 

TCCGTGCCGG 120 

CGGTCTGACG 160 

CAG CAG C CGG 2 00 

CGCCTTAAGG 240 

GGCGGGGATG 280 

GCACAGGCCA 320 

TGAGTGCCTC 360 

CCACCAGCCT 4 00 

GTCTGACTAG 440 

CGAATTCTTT 4 80 

AGGATTCGAA 520 

TCTGCTCCCT 560 

TGCTTTTACT 600 

ATCATATCAA 640 

GAAAGAATTC 680 
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CTGCAGCCCG CCGCGTTGCT GGCGTTTTTC 
GCCCCCCTGA CGAGCATCAC AAAAATCGAC 
GAGGTGGCGA AACCCGACAG GACTATAAAG 
TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 
CCCTGCCGCT TACCGGATAC CTGTCCGCCT 
GGGAAGCGTG GCGCTTTCTC AATGCTCACG 
CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG 
TGCACGAACC CCCCGTTCAG CCCGACCGCT 
CGGTAACTAT CGTCTTGAGT CCAACCCGGT 
TTATCGCCAC TGGCAGCAGC CACTGGTAAC 
GAGCGAGGTA TGTAGGCGGT GCTACAGAGT 
GTGGCCTAAC TACGGCTACA CTAGAAGGAC 
ATCTGCGCTC TGCTGAAGCC AGTTACCTTC 
TTGGTAGCTC TTGATCCGGC AAACAAACCA 
CGGTGGTTTT TTTGTTTGCA AGCAGCAGAT 
AAAAAAGGAT CTGGGGGATC CGGAGAGCTC 
GGATGCATGG ATGAGGGAAA GGAGGTAAGA 
ATAAGCAGGA ACTTTGAAGA CTCAGTGACT 
TAAAGACTCA GTGACTTCTG ATCCTGTCCT 
CCTTGTTGTC CCAAGAAAGC GGCTTCCTGC 
GGACCCCTTC CCTGGAAGGT AAAACTAAGG 
AGAAATTTTT CCACCATTGG TGCTTGGTCA 
TGATGAGCTC ACTCTAGATG AGAGAGCAGT 
CAGAGACTCG AATTTCCGGA GCTATTTCAG 
CGTTTTGTGC AATTTCACTT ATGATACCGG 
GTTGCTATTT TGGAAACTCC CCTTAGGGGA 
CTGGCCCTAT AAAGGGCCAG CCTGAGCTGC 
TG C AG AGG AT CAAGACAGCA CGTGGACCTC 
TCCCACA 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: No 

(iv) ANTI- SENSE: No 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 



CATAGGCTCC 720 

GCTCAAGTCA 76 0 

AT ACCAGG CG 800 

CCTGTTCCGA 84 0 

TTCTCCCTTC 880 

CTGTAGGTAT 92 0 

CTGGGCTGTG 960 

GCGCCTTATC 1000 

AAGACACGAC 104 0 

AGGATTAGCA 1080 

TCTTGAAGTG 112 0 

AGTATTTGGT 116 0 

GGAAAAAGAG 1200 

CCGCTGGTAG 124 0 

TACGCGCAGA 12 80 

CCAACGCGTT 1320 

TCTGTAATGA 13 6 0 

CAGTGAGTAA 14 00 

AACTGCCACT 144 0 

TCTCTGAGGA 14 80 

ATGTCAGCAG 1520 

AAGAGGAAAC 1560 

G AGGG AGAG A 16 00 

TTTTCTTTT C 164 0 

CCAATGCTTG 1680 

TGCCCCTCAA 172 0 

AGAGGATTCC 1760 

GCACAGCCTC 1800 

1807 



GGTACCTGCC ACCATGGCGA AGGGCGAGGA 
GGCGTGGTCC CAATTCTCGT GGAACTGGAT 
ATGGGCACAA ATTTTCTGTC AGCGGAGAGG 
TGCCACATAC GGAAAGCTCA CCCTGAAATT 
ACTGGAAAGC TCCCTGTGCC ATGGC CAACA 
CCTTCACCTA TGGCGTGCAG TGCTTTTCCA 
CCATATGAAG CAGCATGACT TTTTCAAGAG 



ACTGTTCACT 40 

GGCGATGTGA 80 

GTGAAGGTGA 120 

CATCTGCACC 160 

CTGGTCACTA 200 

GATACCCAGA 240 

CGCCATGCCC 280 
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GAGGGCTATG TGCAGGAGAG AACCATCTTT 
ACGGGAACTA CAAGACCCGC GCTGAAGTCA 
TGACACCCTG GTGAATAGAA TCGAGTTGAA 
TTTAAGGAAG ATGGAAACAT TCTCGGCCAC 
ACAACTATAA CTCCCACAAT GTGTACATCA 
GCAAAAGAAT GG CAT CAAGG TCAACTTCAA 
AACATTGAGG ATGGATCCGT GCAGCTGGCC 
AACAGAACAC TCCAATCGGC GACGGCCCTG 
AGACAACCAT TACCTGTCCA CCCAGTCTGC 
GATCCCAACG AAAAGAGAGA CCACATGGTC 
TTGTGACCGC TGCTGGGATC ACACATGGCA 
GTACAAGTGA GCGCCTTAAG GGCCATATGG 
CCTTGACCCC AGG CGGGG AT GGGGGAGACC 
GCCCCCGGGC AGCACAGGCC AATGCCCGTC 
AGGATGAGTA GTGAGTGCCT CTCCTGGCCC 
CACTCCAGTG CCCACCAGCC TTGTCCTAAT 
TGGATCATTT TGTCTGACTA GGTGTCCTCT 
AAGCTTGATA TCGAATTCTT TCGGACTTTT 
GTGGTGGGGG AAGGATTCGA ACCTTCGAAG 
CAGATTTAGA GTCTGCTCCC TTTGGCCGCT 
ACCACGGGTA ATGCTTTTAC TGGCCTGCTC 
AAGCGGGGCG CATCATATCA AATGACGCGC 
TGTTACGTTG AGAAAGAATT CCTGCAGCCC 
TGGCGTTTTT CCATAGGCTC CGCCCCCCTG 
CAAAAATCGA CGCTCAAGTC AGAGGTGGCG 
GGACTATAAA GATACCAGGC GTTTCCCCCT 
TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC 
CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT 
CAATGCTCAC GCTGTAGGTA TCTCAGTTCG 
TTCGCTCCAA GCTGGGCTGT GTGCACGAAC 
GCCCGACCGC TGCGCCTTAT CCGGTAACTA 
TCCAACCCGG TAAGACACGA CTTATCGCCA 
CCACTGGTAA CAGGATTAGC AGAGCGAGGT 
TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 
ACTAGAAGGA CAGTATTTGG TATCTGCGCT 
CAGTTACCTT CGGAAAAAGA GTTGGTAGCT 
CAAACAAACC ACCGCTGGTA GCGGTGGTTT 
AAG C AG C AG A TTACGCGCAG AAAAAAAGGA 
CCGGAGAGCT CCCAACGCGT TGGATGCATG 
AGGAGGTAAG ATCTGTAATG AATAAGCAGG 
ACTCAGTGAC TCAGTGAGTA ATAAAGACTC 
GATCCTGTCC TAACTGCCAC TCCTTGTTGT 
CGGCTTCCTG CTCTCTGAGG AGGACCCCTT 
TAAAACTAAG GATGTCAGCA GAGAAATTTT 
GTGCTTGGTC AAAGAGGAAA CTGATGAGCT 
GAGAGAGCAG TGAGGGAGAG ACAGAGACTC 
AGCTATTTCA GTTTTCTTTT CCGTTTTGTG 
TATGATACCG GCCAATGCTT GGTTGCTATT 
CCCTTAGGGG ATGCCCCTCA ACTGGCCCTA 
GCCTGAGCTG CAGAGGATTC CTGCAGAGGA 
ACGTGGACCT CGCACAGCCT CTCCCACA 



TTCAAAGATG 


320 


AGTTCGAAGG 


360 


GGGCATTGAC 


400 


AAGCTGGAAT 


440 


TGGCCGACAA 


480 


GATCAGACAC 


520 


GACCATTATC 


560 


TGCTCCTCCC 


600 


CCTGTCTAAA 


640 


CTGCTGGAGT 


680 


TGGACGAGCT 


720 


TGAGTGGATG 


760 


TGTAGTCAGA 


800 


CTTCCCCTGC 


840 


TGGAAGTTGC 


880 


AAAATTAAGT 


920 


ATAATATTAT 


960 


GAAAGTGATG 


1000 


TCGATGACGG 


1040 


CGGGAACCCC 


1080 


CCTTATCGGG 


1120 


CGCTGTAAAG 


1160 


GCCGCGTTGC 


1200 


ACG AG CATCA 


1240 


AAACCCGACA 


1280 


GGAAGCTCCC 


1320 


TTACCGGATA 


1360 


GGCGCTTTCT 


1400 


GTGTAGGTCG 


1440 


CCCCCGTTCA 


1480 


TCGTCTTGAG 


1520 


CTGGCAGCAG 


1560 


ATGTAGGCGG 


1600 


CTACGGCTAC 


1640 


CTGCTGAAGC 


1680 


CTTGATCCGG 


1720 


TTTTGTTTGC 


1760 


TCTGGGGGAT 


1800 


GATGAGGGAA 


1840 


AACTTTGAAG 


1880 


AGTGACTTCT 


1920 


CCCAAGAAAG 


1960 


CCCTGGAAGG 


2000 


TCCACCATTG 


2040 


CACTCTAGAT 


2080 


GAATTTCCGG 


2120 


CAATTTCACT 


2160 


TTGGAAACTC 


2200 


TAAAGGGCCA 


2240 


TCAAGACAGC 


2280 




2308 



448026 1 



